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1. Introduction 

Let E ff be a compact surface of genus g > 2. Let Q g be the moduli space of 
unit-area holomorphic quadratic differentials on S 9 . That is, a point q £ Q g is a 
equivalence class of pairs (M,ui), where M is a genus g Riemann surface, and w 
is a holomorphic quadratic differential on M, i.e., a tensor with the form f(z)dz 2 
in local coordinates, such that J M \u>\ = 1. Two pairs (Mi,u)i) and (M2,W2) are 
equivalent if there is a biholomorphism / : M\ — > M2 such that f*u>i = u>2- 

Given a pair q £ Q g , one obtains (via integration of the form) an atlas of charts 
to C = R 2 , with transition maps of the form z 1— » ±z + c. Similarly, given such 
an atlas of charts, one obtains a holomorphic quadratic differential by pulling back 
the form dz 2 on C. 

These charts allow us to define a SL(2, R) action on Q g (and Q g ) given by linear 
post-composition with charts. 

Q g is naturally the unit cotangent bundle to M g , the moduli space of Riemann 
surfaces. The fiber over each point M £ A4 g is the vector space of holomorphic 
quadratic differentials on M. 

The space Q g is naturally stratified by integer partitions f3 of Ag — 4. Strata are 
not always connected, however, they have at most finitely many components 
and are invariant under SX(2,R). For the rest of our paper we work with one of 
these connected components, call it Q. 

Without loss of generality, we will study strata of squares of abelian differentials 
(holomorphic 1-forms). Otherwise, we pass to a double cover. Since our results 
hold for all g > 2, we simply consider the (higher-genus) stratum defined in this 
way. Each stratum Q, while non-compact, is endowed with a continuous, ergodic, 
SL(2, R)-invariant probability measure /iq. 

More details about the basics of moduli spaces and quadratic differentials can 
be found in, e.g., [H3 CHI ESI 

We are particularly interested in the action of the standard subgroups K (maxi- 
mal compact) and A (diagonal matrices) of SL(2, R). These are both one-parameter 
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subgroups, and can be described as follows: 
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We will often be interested also in the action of the semigroup A + = {g t : t > 0}. 

The action of K, known as the circle flow, preserves the underlying holomorphic 
structure, so it acts as identity when projected to A4 g . The action of A is known 
as Teichmuller geodesic flow, since the projection of an A-orbit yields a geodesic 
in the Teichmuller metric on M. g (and in fact, all Teichmuller geodesies arise this 
way). 

Masur ^7] and Veech [251 independently showed the Teichmuller geodesic flow 
is ergodic with respect to and even mixing. Veech showed that it was 
'measurably Anosov', and more recently Forni has obtained explicit formulas 
regarding the hyperbolic behavior. His results, which imply that as long as trajec- 
tories remain in a compact set their hyperbolicity can be controlled, provide much 
of the motivation for our research. 

Our main results are concerned with the following scenario. Fix q G Q, and 
consider the 'circle' Kq = {rgq : < 9 < 2ir}. This set is endowed with a natural 
probability measure v, coming from the Haar measure on K = S . What is the 
recurrence behavior of a trajectory {gtrgq} when 9 is chosen at random according 



This type of situation was first considered by Kcrckhoff-Masur-Smillie m 
order to understand dynamics of billiards in rational angle Euclidean polygons. 
They developed a dictionary between the dynamics of the straight-line flow on 
the singular Euclidean surface determined by a quadratic differential q and the 
recurrence behavior of the geodesic trajectory g t q in the stratum Q. The X-action 
here corresponds to changing the direction of the straight line flow without changing 
the underlying surface. Thus, making a statement about a fixed q and almost 
all directions allowed one to say something about straight line flow in almost all 
directions for a given flat surface, in analogy with Weyl's equidstribution theorem 
on the torus. 

To apply these results to billiards, one follows the unfolding procedure of Zeml- 
jakov and Katok |27j . which translate questions about dynamics of billiards in 
rational angle polygons to that of flows on an associated singular Euclidean sur- 
face. However, the set of surfaces arising from billiards is of measure in every 
stratum, so making statements about almost every point q does not suffice. To 
make statements about billiards, we need statements that hold for all q G Q. 

The main result in ^3 is that for all q G Q, and almost every 9 6 [0, 2ir), 
the geodesic trajectory {gtrgq}t>o is recurrent in Q. As a corollary, one obtains 
that the directional flow for Euclidean polygonal billiards is uniquely ergodic for 
almost every direction. Further results in this direction include estimates on the 
Hausdorff dimension of divergent and bounded trajectories (E1E1)> and further 
relations between dynamics of the straight line flow and recurrence of the associated 
trajectory @JE]. 

Our main results concern finer recurrence behavior of geodesic trajectories. In 
particular, for any fixed q G Q, we estimate the measure of the set of angles such 
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that the associated A + orbit {gtrgq}t>o 'behaves poorly' for a length of time T. 
For us, poor behavior means that the trajectory is spending a lot of time in a 
neighborhood of the cusp of Q. Masur ^8] proved a statistical result, known as a 
logarithm law in this situation, in analogy with earlier results on symmetric spaces 
due to Sullivan [22 and Kleinbock-Margulis [H| . 

We will construct a proper (that is, unbounded off compact sets) continuous 
function V : Q — > R + , and consider the recurrence behavior of {gtfgq}t>o to the 
family of compact sets C\ := {q : V(q) < I}, which form an exhaustion of Q as I 
varies. Our main results can be summarized as follows: 

Theorem 1.1. Fix notation as above. Then 

(1) For all I sufficiently large and all q £ C'i, there are positive constants c\ = 
C\(l, q), c 2 {l), with 

v{d : g t r e q £ C ; ,0 < t < T} < cie~ C2T 

for all T sufficiently large. That is, the probability that a random geodesic 
trajectory has not visited C\ by time T decays exponentially in T. 

(2) For all l,S,T sufficiently large and all q £ Q, there are positive constants 
c 3 = C3(S, I, q), C4 = ca{1), with 

v{6 : g t r e q £ Q, S < t < S + T} < c 3 e~ CiT . 

That is, the probability that a random geodesic trajectory does not enter Ci 
in the interval [S, S + T] decays exponentially in T. 

(3) Let q G Q. For any < A < 1, there is a I > 0, and < 7 < 1, such that 
for all T sufficiently large ( depending on all the above constants ) 

v{0 : ^|{0 < t < T : g t r e q £ d}\ > A} < 7 T 

Result (3) above may be thought of as a large deviations result for the Teich- 
muller flow. While ergodicity guarantees that ^ |{0 < t < T : g t q G C/}| — > Hq{Ci) 
for /iQ-almost every q G Q, our result gives explicit information about the likeli- 
hood of bad trajectories. Notice, however, this is not a traditional large deviations 
result, which estimates the probability of a deviation of any e > from the ergodic 
average. Other interesting results concerning deviations are due to Bufetov who 
proved a central limit theorem for this flow. 

In |11|. Forni related this type of fine recurrence behavior for the geodesic tra- 
jectory {gtq}t>o to deviation of ergodic averages for the straight line flow on the 
flat surface associated to q. He proved that for almost every q G Q, the associated 
flow has the same deviation behavior. However, his result gives no information on 
billiards, as the set of quadratic differentials arising from billiards are a set of mea- 
sure zero. Since our results provide fine recurrence information about the geodesic 
trajectory {gtfeq\t>o for oil q G Q, and almost all 9, we conjecture the following: 

Conjecture. For all rational- angle Euclidean polygons, the deviation of ergodic 
averages for the billiard flow is the same for almost all directions, and depends only 
on the SL(2,S>) orbit of the associated quadratic differential. 

It was also shown in that as long as a geodesic trajectory stays within 
a compact set, the rate of expansion/contraction in the tangent space along the 
trajectory is bounded away from 1. Thus our results can be used to obtain explicit 
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estimates on the hyperbolicity of the flow along specific trajectories. Avila-Gouezel- 
Yoccoz £Q used exponential return estimates to a different family of compact sets 
to prove the exponential rate of mixing for the Tcichmullcr geodesic flow, which 
was the original motivation for this research. 

Other quantitative recurrence results for dynamics on Tcichmullcr spaces were 
obtained by Minsky- Weiss [23] for the case of Teichmuller horocyclc flow. 

We also have a collection of results for a certain class of random walks, defined 
as follows: Fix t > 0. Given that we are at a point q G Q, the next point in our 
trajectory will be chosen at random according to Haar measure on S 1 from the 
'circle' of radius r,{g T rgq : < 9 < 2tt}. Note that hq is a stationary measure 
for this walk, since it is SL(2, K)-invariant. Let {X n }^ =0 denote the random walk 
generated this way. 

Remark: By hyperbolic geometry, one can see that trajectories of this walk closely 
approximate geodesies. Thus, understanding recurrence properties of the walk gives 
one insight into properties of the flow. 

We define P q {E) :— Prob(_E|Xo = q) for any event E defined on the trajec- 
tory {Xjij^g starting at Xq = q. Also, for any measurable A C Q. we define 
P n (q,A) := P q (X n G A). That is, P n (q, .) is the probability distribution of X n 
given X = q. 

Theorem 1.2. For all t sufficiently large, we have: 

(1) There is an l(r) such that for all I > 1(t), and all q ^ C\, there are constants 
C5 = C5(q, I, t), Cq = cg(Z,r), such that 

P q (Xi iQ:l<i<n)< c 5 e- CBn . 

(2) For all compact C C Q, e > 0, there is a Ci D C such that Vq G C , and for 
all m > 0, 

P q (X m G Ci) >l-e. 

(3) For all e > 0, there is a I = 1(e) > such that Vg £ Q, there is an M(q) 
such that for all m > M(q), 

P q (X m ed) > l-e. 

The rest of the paper is organized as follows: In the next section, we give more 
detailed statements of our main results, and construct the function V. We also 
give a version of our results for general SL(2, Reactions. In sectional we prove 
our results for random walks. In section^] we collect some technical lemmas about 
change of polar coordinates on hyperbolic space. In section we construct the 
required lemmas from the theory of large deviations. In section |B| we prove our 
main theorems for the flow. 

2. Statement of Results 

The precise statements for the flow regard a family of proper functions Vs, < 
S < 1, and the compact sets Cs,i '■= {q ■ Vs(q) < I}. We have: 

2.1. Flow results. 

Theorem 2.1. For every 1 > 6 > there is a proper (i.e., unbounded off compact 
sets), smooth, K -invariant function Vs : Q — > M + and positive constants to = 
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to(S),lo = lo{S),a = a(8) such that for all I > Iq, there are 1 > 8' = d'(l,S) > 8, 
with 8' decreasing as a function of I, so that for all q £ Cs,i 

u{9 : g t r e q $ C s , h < t < T} < a YM. e -£-S')T , 

for all T > to. 

This is the precise version of part (1) of Theorem 11.11 Note that the outside 
term essentially depends only on V$(q). 

Theorem 2.2. Let q G Q. For every e > 0, 1 > 5 > 0, there are positive constants 
Sq,Ti,Ii depending on 8, 1 > 8" = 5" (I) > 8, and a = a(q) such that 

v{6 : g t r e q £ C 5 , h S <t<S + T}< ae^ 1 - 5 "^, 

for all S > So, T > T\, and I > l\, with 

s oci j \ h Vs(gsreq) 
ol < 8(1 + e) sup ab ; , 

ee[o,27r) * 

where b depends only on the curvature of H 2 . 

Here, So depends only on the choice of curvature for the hyperbolic plane H 2 , and 
T\ is the maximum of to from Theorem 12.11 and a T2 depending only on curvature. 

Theorem 2.3. Let qeQ. For any < A < 1, and any < 8 < 1 there is a I > 0. 
and 7 < 1, such that 

v{e : ^|{0 <t<T: g t r e q £ C 5>1 }\ > A} < 7 T , 

for all T sufficiently large. 

This theorem uses a technical tool from the theory of large deviations, Proposi- 
tion [O] which makes it difficult to track the dependence of 7 on I and A. Clearly, 
if an I works for a fixed Ao and 8, it works for all A > Ao, and any larger / (with 
the same 8) will work for Ao. Similarly, the same / will work for the same Ao and 
any smaller 8. 

In a personal communication, Forni posed the following question: for every q 
in Q, and almost all 9, does there exist a ( > such that the geodesic trajectory 
{gtTeq}t>o spends at least a proportion £ of its time in a fixed compact set? The 
following corollary answers in the affirmative. 

Corollary 2.4. Let q e Q,( > 0. Fix e > 0. Let I be such that Theorem HOI is 
satisfied with A = 1 — £, 8 = e. Then for v-almost every 9, 

limsup - |{0 < t < T : g t r e q £ C S ,i}\ < A. 

The proof of this result follows from an application of the Borel-Cantelli lemma 
and Theorem 12.31 

2.2. Random walks on Q. Let {X„}^ be our trajectory. That is, 

Xn+i = 9ri"e n X n , 

where {9 n }^L Q are independent and identically distributed (i.i.d.) according to the 
uniform distribution on [0,27r) (equivalently, X n+ i = g T k n X n . where {fc n }^Lo are 
i.i.d. according to Haar measure on K). This implies is a Markov chain, with 

transition probability function P{x, A) = v{9 : g T req € A}. 
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Theorem 2.5. Fix < S < 1. Let {Cs.i} be as in Theorem \2.1\ Then there is a 
to = To(S) > such that for all r > to, there is a Iq = Iq(6,t) such that for all 
I > Iq, there is a 7 = 5) < 1 such that for all q C$ } i, and for all n > 1, 

P 9 (Xj g C,,, : < j < n) < YMy i - 

This is completely analagous to Theorem 12. II and indeed the proof of this result 
will be essential to the proof of Theorem 12.11 

For the rest of the section, fix 1 > 5 > 0, and r > t (5). 

Theorem 2.6. Let e > 0. Then: 

(1) For all compact C C Q, there is a C$ 1 D C such that \/q E C, and for all 
m > 0, 

P q (X m G Cfe,,) >l-e. 

(2) There is a I > smc/i £/iat G Q, </iere is an M(q) such that for all 
m > M(q), 

P q (X m e C S j) > 1-e. 

Finally, we have: 
Corollary 2.7. 

Fix q G Q, < A < 1, and e > 0. TTien there is a I = l(q, A, 5, e) > so that, for 
all n sufficiently large, 

P,(i|{l<i<»:A,6 > > 1 - e 

This is analogous to Corollary 12. 41 
Remark: I can be chosen uniformly as q varies over a compact set. 

2.3. SL(2, Reactions. Let S , L(2,R) act continuously on a topological space AT. 
Suppose there is a family of A'-invariant functions Vg : X — >K, 0<<5<1 satisfying 
the following properties: 

• For all ir G AT. consider the function Vs. x : SX(2,R) — ► M + defined by 

= (3a;). Note that by X-invariance, we can view V$ tX as a function on 
H 2 = SO(2)\SL(2, R). We require the following : For all a > 1, there exists 
a « > such that for any pel 2 with d(p, i) < ft, and any ft, G SL(2, R) 

(2.8) ff" 1 ^^) < V s , x (ph) < o-V s , x (h). 

We say such a function is logsmooth. 

• For all 1 > S > 0, there is a constant c = c(S) such that for all sufficiently 
large r, there is a 6 = 6(r, <5) such that for all x G X, 

r-2-n 

(2.9) (A T V s )(x):= V 5 {g T rex)dv{9) < ar^-^ T V s {x) +~b. 

Jo 

All our results on recurrence to the sublevel sets Cs,i from subsections !2 1 ll and !2 . 21 
while constructed for the space Q hold for any space X with the above properties. 

The key lemma needed to prove our results will be the construction of a family 
of functions {V$} on Q satisfying the above requirements. 

Lemma 2.10. There is a family of smooth, proper functions V$ : Q — > R + , < 
S < 1 satisfying equations (|2.8|l and l|2.9|l . 
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We require the following technical lemma from [H] (page 465, Lemma 7.5). 

Lemma. For all S > there are logsmooth functions Vq, ■ ■ ■ ,V n '■ Q — ► K + such 
that Vq is proper, and for every r > 0, there are constants w = w(r, S),b' = b'(r, 5), 
and d = c!(S), independent of r such that for all < i < n, and G Q, 

n 

(A T Vi)(q) < c'e-^-^Viiq) + w E Vj(q) + 9. 

j=i+i 

Remark: In fact Vo(q) = max fl, t^jitt \ where l(q) denotes the length of the 
shortest saddle connection on q. Recall that a saddle connection is a geodesic (in 
the metric determined by q) connecting two zeroes of q. 

Proof of Lemma 12.101 

Fix 5, t > 0. Let c = 2c' A = f , Xi = + l)* , for < i < n. Note that 

A, < 

2w 



V Xi < — A,-. 

^ - On, 3 



i=0 

Set b = V Y2=o A - Let V s (q) = Eto X * V M- Then 



(A T Vs) (q) = E Aj (A T K:) (?) < E M ^ _(1 " S)T ^(«) + w E ^(«) + 9 

i=0 i=0 \ i=i+l 

= E Aic'e-^-^F^g) + ™E E A * W + 6 

i=0 3=1 \i=l / 

n I 3-1 N 

= cV^-^AoVoO?) + E^(9) ( Ve-'^ + ifl^Ai 
'jp-( 



c e 



1 - i)T A Vb(g) + E 2A,c' e -a-*>^ .(g) + 6 

3=1 

< ce-^ T F 5 (g) + 6. 

Thus we have constructed a family of functions Vs satisfying equations (|2.8() 
and (|2.9|) . That V5 satisfies <|2.8|) follows from the logsmoothness of the ViS noted 
in [5j (where they are called oVs) on page 471, at the beginning of the proof of 
Proposition 7.2. Finally, Vs is proper since Vq is proper. □ 



3. General Markov Chain results 

Our goal in this section is to recall some results from the theory of Markov Chains 
which will allow us to prove Theorems 12. 512.61 We also give some applications to 
random walks on homogeneous spaces, as considered in 0. Many of these results 
can be found, in greater generality, in [2"Tj . 

Before stating our main results, we recall some basic notation: If {X n } is a 
Markov chain on (S,S), S the state space, and S the c-algebra, then, for any 
event E (an event E is a set in the product u-algebra) and any starting point 
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x £ S , P X {E) := P(E\Xq = x), i.e., it is the probability of the event E occuring 
given that our starting point was x. Similarly, given a measurable subset C 6 S, 
we write P n (x,C) = P x (X n £ C), and we define the first hitting time of C by 
t c := inf{n > 1 : X n e C}. 

The main result of this section is: 
Proposition 3.1. Let S be a non-compact topological space, and S its Borel o~- 
algebra. Let {X n }^ =0 be a Markov chain on (S,S). Suppose there exists a smooth, 
proper function V : S — > K + and constants < c < 1 and b > such that 

(PV)(x) := E{V{X 1 )\X = x) < cV(x) + b, 

for all x £ S. Then 

(1) For anyl>0 and x <£ Q := {y £ S : V(y) < I}, 

/nt,/ \ V(x) ( b 

Pn(x) := P x (t Ci >n)< — ^ ( c + - 

for all n > 0. 

(2) For all e > 0, and for all compact C C S , there is an I such that Vx £ 
C,m > 0, 

P m (x,Q) >l-e. 

(3) For all e > i/iere is an I > such that Vx £ S 1 , i/iere is an A/(x) > such 
that for m > M(x), 

P m (x,Q) >l-e. 

(4) For allx£S,0<X<l, and e > 0, £/iere is a I — l(q, A, e) > so </ia£, /or 
all n sufficiently large, 

P x f i|{l < i < n : Xi e Ci}\ > Xj > 1 - e. 

If we fix A and e, t/ien ^ can 6e chosen uniformly as x varies over a compact 
set. 

Proof of Theorems 12. 5ll2.til and Corollary 12. 71 : Combine Proposition ^. ll with 
Lemma ETTUl For Theorem 12*31 fix 5 > and let r be such that c = ce~ ( * 1 ~ s "> r ° < 1, 
we obtain our result with 7 = ( c + j J for I > . □ 



;y l-c' 
Proof of Proposition |3"TT1 

• Proof of (1): Let B n := {t Ci > n}. Then p n (x) = P x (B n ). For n > 0, 

Ipn < E x (V(X n ) : B n ) =: D n 

since on B„, X n £ Ci, i.e. > I. 

Now, £?„ G B n _i, so 

£>„ < £? X (^(JT„) : B n -i) 
(3.2) = E x {E{V{X n )\X n -i) : S n _ x ) 

= £ a ((Py)(X n _i) :B„_i) 

where we are using the Markov property in the 2nd line. 
Now we can apply our condition 

(PV0(X n _i)<cV(X n _i) + &. 
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This yields 

D n < c.D„_i + bpn-t, 

and using the observation that p n _i < , we obtain the recurrence 

relation 

D n < ( C +y) D n-1- 



Iterating this, we obtain 

D n < D (c + j 

Since Dq = V(Xq) = V(x), and p n < ^f-, wc obtain our result. Note that 
the result is only meaningful if (c + y) < 1, which is equivalent to setting 

1 — c 

• Proof of (2): We have (PV)(x) < cV(X) + b for c < 1. Iterating this, we 
get that (P m V)(x) < c m V(x) + b\ where b' does not depend on m or x. 
Set I = sup eC v \ y >+ b , Then we have that 

lP m (x,C{) < E x (V(X m j) < c m V(x) + b', 
so we get that P m (x, Cf ) < e as desired. □ 

• Proof of (3): For the third property, we select 1 = 2b' /e, where b 1 is as 
above. For m sufficiently large, c m V(x) < 6', so by the argument in part 
(2), we can get our conclusion. □ 

• Proof of (4): Fix x E S, < A < 1, and e > 0. By part (3), there 
is an I > so that for n sufficiently large, P(X n € C;) > 1 — e', where 
< e' < |e(l — A). Set S n = i Y^i=i XCi(^i), where xc t is the indicator 
function of C/. Then, for any A < 1, 

E(S n ) < XP(S n < A) + P(S n > A) 
= A + (1 - X)P(S n > A). 

Thus, we have 

E(S n ) - A 



P(S n > A) > 



1 - A 



Now, for n sufficiently large E{S n ) > 1 — |e', thus, 

1 — -e' - A 

P{S n > A) > ^ > 1 - e. 

The fact that I can be chosen uniformly as x varies over a compact set 
follows from part (2). □ 

These types of questions were considered for random walks on homogeneous 
spaces in |7] by Eskin and Margulis. They constructed a function V on their state 
space satisfying the conditions of ProDOsition l3.ll and used this to draw conclusions 
(2) and (3). Conclusions (1) and (4) appear to be new results for these walks. 

Precisely, we have the following: 
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Theorem. Let G be a semisimple Lie group, and T a non-uniform lattice. Let 
fi be a probability measure on G satisfying the conditions of Theorem 2.1 in 
Consider the Markov chain {X n }^ =0 defined on G/T by the measure /i: 

X n +i = g n X n , 

with {g n }%Lo an i.i.d. (with distribution \x) seguence of elements of G. Then there 
is a function V : G/T — > R + satisfying the conditions of Proposition V3.1X Thus, 
conclusions (l)-(4) of the Proposition are satisfied. 



4. Polar coordinates and shadowing 

We require two lemmas about change of polar coordinates in the hyperbolic 
plane H 2 = SO(2)\SL(2, R). We fix two positive numbers ti,t 2 , and bascpoints 
i and zq = i.gt t rg (these will correspond to our bascpoint q and an arbitrary Qq 
in its SL(2, R)-orbit. projected to SO(2)\Q). We let d(.,.) denote distance in the 
hyperbolic plane. 

Consider the circle of radius t 2 around zq, defined by {z^, = i.g t2 r^g tl rg : < <f> < 
27r}. We say that t 2 , <fi are the polar coordinates of z$ based at zq. For each cf>, we 
define D = D tlM {4>) and * = ^ tl ,t 2 {(f>) by z = i.g D (<f,)re+y(<i>), i-e. £>(</>), + 
are the polar coordinates of z^ based at i. Note that D, ^ are independent of 9. 

Geometrically, D(<fi) is the distance and ^(<f>) is the angle (measured clockwise 
from the the geodesic connecting i to zq) of the geodesic segment connecting i to 
z,)). Hyperbolic trigonometry (the laws of sines and cosines, appied to the triangle 
formed by the points i, zq, and z§) yield: 

(4.1) coshD((/>) = coshii cosh£2 + sinhii sinhi2 cos</>, 
and 

, sinhto 

(4.2) Smf (^^i S ^- 

If t 2 > iij i lies inside the circle of radius r around z , and thus, the map ^ is both 
one-to-one and onto. If ti > t 2l the point i is outside the cirlce, and \I> is neither 
one-to-one or onto. In this case, the image is an interval, with boundary points 
such that the geodesic determined by those angles intersects the circle of radius t 
tangentially. In the interior of the interval, each point ip has two preimages, call 
them (f>i, (f>2 one such that D(cf>i) f=a ti — t 2 , and one such that D(<fi 2 ) « ti + t 2 . For 
our applications, we will only be concerned with <p 2 . 

The key technical lemma is as follows: 

Lemma 4.3. Let A C [0, 2tt) be a measurable set. Then, for every e > 0, there are 
ti,T2, such that for all t\ > T\,t 2 > t 2 the neighborhood U = ^t t .t 2 ([ — 7r /2, tt/2]) of 
satisfies 

p(Un$(A)) 



< 4(l + e)v(A). 



Proof: For this estimate, we need to control the behavior of the derivative V?'. 
More precisely, we need to control ratios <3/' '(4>i) '(fo), with 4>i,(f> 2 G ^ _1 ?7, so 
we can compare v and ^*v, where ^>^f(E) = i>(if?~ 1 E). 
We have the following claim: 
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Claim. Let 77 > 0. For t\,ti sufficiently large, 

^(1-7?) < |*'(0)|< e -^(l+^), 

for (f> e [— 7r/2,7r/2]. 

Proof of Claim: 

Implicit differentiation of equations (|4.1J) and (|4.2(l yield: 

(4.4) £>'(</>) sinhD(<?!>) = - sinhii sinh< 2 sin0 
and 

. , . , , cosd>smhD(d)) + sin 2 6cothD((b) sinhii sinhto 

(4.5) tf'(0)cos*(0) =sinht 2 ^ . , 2na , • 

smn D(q>) 

Let k > 0. Let be large enough so that for all <j> € [— 7r/2, 7r/2], 

(1) D(<f>) > h + h - «, 

(2) coth(D(</>)) > 1 - k 

(3) cos(#(<£)) > 1 - « 

(4) I sinhti sinht 2 — sinh £>(</>) | < ft 

(5) i_ K< l^i j i^h k<1 + Kforz = lj2 _ 

That we can achieve the above inequalities follows from hyperbolic geometry and 
the basic properties of sinh and cosh. 

Let 77 > 0. Using the above inequalities and some basic algebra, we can select 
Ti, t 2 such that for all ti > r 1; i 2 > t 2 , we have 

v V - e_^( 2c os^ + sin 2 ( /)) - 

Remark: The expression ^— !-(2cos(/> + sin 2 <fi) is obtained by replacing the quan- 
tities in equation l|4.5|l with their approximations (l)-(5). 

Now, let f(4>) = 2cos</> + sin 2 0. We have 1 < f(4>) < 2 for 4> e [-tt/2, tt/2]. 
Thus, 

^J-V-V) (l + trte-* 1 , 

completing the proof of the claim. 

To complete the proof of the lemma, let e > and rj be such that < 1 + e. By 
the claim, we know that the proportion of measure of any set in [— 7r/2, 7r/2] cannot 
be expanded by more than 2j^- < 2(1 + e) under \t, since that is the maximum of 

|^|4 for ^,02 6 [-tt/2, tt/2]. Now, since i/([-7r/2, tt/2]) = 1/2, we have 

v(An[-«/2,«/2]) 

1/(1-^/2^/2]) " 1 ; ' 

which yields 

«p< !(1+e ).MA) = «( 1+ +(4 

□ 
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Our second main lemma is as follows: let r > 0. For any t > 0, define 
I(t) := {0 < 9 < 2ir : d(i.g u i.g t r e ) < 3r}. 



Fix k > 0. For any 9 G I(t), define 

Ug := {0 < < 2tt : D tiT (0) > 1 - k, (*t lT (0) + 0) £ /(< + r)}. 

Let = v(Ug). The following lemma is proved in [H]. For notational convenience, 
we write drop the subscripts for T and S. 

Lemma 4.6. There is a constant c" > such that for all n > 0, t/iere is a r > 
swc/i i/iai /or a/Z G 7(i), ffte map ^lig is a diffeomorphism onto its image, and, 
making the subsitution tp = ^ ((/)), we have 



Proof: [8], page 467, Lemma 7.6. 
Remarks: 

• d9 denotes dv(9), and since we have normalized v to be a probability mea- 
sure, we do not need to divide by 2ir. 

• In [S], the sets are defined by J(t) := [— pe - *, pe _t ] for some positive 
constant p, and they require p to be large enough so that the diameter of 
the set J(t) — {i.gtre ■ 9 G lit)} is at least 2r. By hyperbolic geometry, our 
sets 7(t) are of this form, and they obviously satisfy the required condition. 

We have the following Corollary, also from 

Corollary 4.7. Let f : 5X(2,R) — ► R 6e a logsmooth SO (2) -invariant function. 
Fix a > 1. Lei k > be as in equation (|2.8() . Fia; t so i/iai Lemma \4.6] holds. Then 
there is a d > 0, independent of t, such that 



In this section, we prove the key technical lemma for our main large deviations 
result Theorem O We assume some familiarity with the theory of conditional 
expectation. Excellent references include [2*4ll6*]. 

Proposition 5.1. Let {t~;}°^ be a sequence of positive real-valued random variables 
on a probability space (fl,J-,P). Let be af filtration of T such that for all i, 

T{ G Ti, i.e., Tj is Ti-measurable. Suppose there exist positive random variables 77, £ 
with En < E£, Et; > 0, and a real number 9q > such that, for all < 9 < 9q: 





Proof: jS], page 468, Lemma 7.7. 



5. Large Deviations 



(1) E(e e ^\F 2i -i) < E(e e «) 

(2) E(e- e( - T2i - 1+T2 ^\J 7 2i - 1 ) < E(e 

Let T n = J2i=o T i- Let 
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Then, VA > Erj/E^, there is a < 7 < 1 such that for all T sufficiently large 



p(±j\(t)dt>x) < 7 T . 



For our application, th will be the time a random geodesic spends outside the 
compact set, and Tn-\ the time inside on the ith 'sojourn'. r\ is a stochastic upper 
bound for T2i, and £ as a stochastic lower bound for the length of a 'cycle'. Thus 
condition 1 should be thought of as a stochastic upper bound for the time spent 
outside and condition 2 a stochastic lower bound for the total time of a cycle. In 
our application, we will give a stronger, deterministic lower bound: in fact, we will 
construct our variables so that r 2 i-i > C, for some fixed C. This clearly implies 
condition 2, simply by taking £ = C. For condition 1, we will use Theorem 12. II to 
show that T2i cannot grow too large. 

Proof: 

Define f{6) := E(e e 'i) and g(6) := E{e~ e ^). 
Let N(T) = sup v{k : T 2k < T}. Then, 

P ^ X{t)dt > At| < P {j o x (t)dt > AT and N(T) < ct| + P (N(T) > cT) 

/VcT/2\ \ 

(5.2) = pi t 2i > XT \ +P (N(T) > cT) . 

We estimate each of these terms in turn. Let n := \cT/2\. Then, since 

/[cT/2\ \ /[cT/2\ \ /[cT/2\ 

p I J2 T2i > x ' n J Z T2i > AT j - p ( Z ™ > A '( n + x ) 

to estimate the first term it suffices to estimate -P(X)™=o T2i > ^' n )i where A' = 
2A/c. Now, for any O > 9 > 

< e~ ex ' n E (e e ^?=° T2i 

(5.3) < e- ex ' n f(8) n . 

The last inequality follows from the first condition in our theorem, and the fact 
that each Tj is T% -measurable. Since equation holds for any 80 > 6 > 0, we have 

P[ Vr 2i >A'n) < inf (f{9)e- ex ' 

Let A' > Erj. Then, letting 

F(0) = f(8)e- ex ' = E (e e ("- A ')) , 

we get F(0) = 1, 

F'{6) = E (e- ^- x '\ri - X 1 )) . 
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This implies that 

F'(0+) = E (n - A') <0. 

Thus, there is a < 9\ < 8 such that F(9i) := 7' < 1. Plugging this into 
equation l|5.3[) yields the estimate for our first term. 

To estimate the second term, let £j = r 2 i-\ + Fix c > l/E^. By a similar 
argument to that above, we obtain 

P(N(T)>cT)< inf e 6T g{0y T . 

Let 

G{6) = g{e)e B ' c = E (V^ 1 ^)) . 
Once again, as above, we obtain 

G'(0+) = -E(£-l/c) < 0. 
Thus, there exists 82 with G(#2) = 7" < 1, and so we have our desired estimate. □ 

Corollary 5.4. With notation as above, 

limsup- / X(t)dt < A 

with probability 1 for all A > En/ E^. 

In order to prove this corollary, we need the following technical lemma: 

Lemma 5.5. Let < 7 < I. Let U : R + — » K + be such that for all sequences 
{ a n}™ =0 w tthY^n=o"f a " convergent, 

lim sup C/(a„) < c, 

n — >oc 

for some c > 0. Then 

lim sup U(T) < c. 

T-»oo 

Proof: We proceed by contradiction. Suppose limsupy^^ U(T) > c. Then, there 
is a sequence of time t ni t n — » 00, such that U(t n ) > c. Take a subsequence t nk , 
where is such that t n > k for all n > n^.. Such a subsequence exists since i n 
diverges. Now, letting = t rik , note that a*; > fc, so 7 afc * s convergent. 

So limsup/,.^^ U(ak) < c. But by definition, U(cik) > c for all k. This is a 
contradiction. □ 

We now proceed with the proof of Corollary 15.41 Let U(T) — ^ X(t)dt. Let 
7 be as in the conclusion of Proposition 15.1 1 Then, for any sequence a n we have 

00 00 
^P(C/(a„)>A)<^ 7 a "- 

n=0 n=a 

Thus, if X^Lo 7°" converges, by the Borel-Cantclli lemma, limsup,^^ U(a n ) < A, 
with probability one. Applying Lemma 15.51 we have our result. □ 
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6. Proofs of main results 



Wc fix the following notation for the rest of this section: fix a 6 > 0, and fix q G Q. 
For h £ SL(2,M), we define Vs : q{h) = Vs(hq). Vs >q is ii'-invariant, and thus can be 
viewed as a function on H 2 . We define the sets Cs,i(q) — {z G H 2 : V q (z) < I}. In 
the rest of this section we work in this H 2 , identifying gtrgq G Q with i.gtrg G H 2 . 
For notational convenience, we drop the subscripts 8 and q, and write V and Ci for 
Vs,q,Cs.i(q). Furthermore, all distances are measured in H 2 . 

Proof of Theorem 12.11 Let q £ Ci. We want to estimate the measure of the sets 



For technical reasons, we will instead study the sets 

B(T) := B(T, I, q) = {9 : 3(p G B'(T, I, q) such that d(i.g T r e , i.g T r<j>) < 3r}, 

where we will specify r shortly. Note that, by definition, and logsmoothness of V, 
there is some a T > 1 such that cj) G B(T,l,q) implies that V(i.gtr§) > l/a T =: I" 
for all < t < T. 

Let B nT = B{nr, I, q), and p nT = v(B nT ). We have 



Our main lemma is as follows 

Lemma 6.1. For all S > 0, and all r sufficiently large there are constants c = 
c(<5), b = b(r, S), so that 



Proof: Note that by definition B nT is a union of arcs of the form I(nr), and as 
such, both Lemma FOl and Corollarv l4. 71 apply (we are also using the fact that V is 
logsmooth). Fix a > 1, and let r be such that we can apply corollary 14.71 Setting 



Let t be large enough so that we can apply Lemma |6. II Proceeding as in the 
proof of Proposition 13 . II part (1), we obtain 



B'(T, I, q) = {6: i.g t rg £C u 0<t< T}. 




D nT < Ce- {1 - S)T + b 



c = ac'c and b = ac'b, we obtain our result. 



□ 




Let t > be such that ce" (1 ~ 5)T ° < 1. 
Let t > tq, and let 



lo > a sup 

t (i <t<2t 



b 



(! _ ce l-«)r)- 



Let 



a = sup a(r), 

to<t<2to 



and set I' 
Let 



l/a. 



5' = 5 + sup — 

to<t<2t t 
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It is easy to check that 8' < 1, and that it is decreasing as a function of I. There is 
some tq < t < 2tq and n £ N such that t = nr. We have 



Rewriting this, we obtain 



□ 



lit) 



Proof of Theorem 12 .21 Let q £ Q. Consider the circle of radius S+T, {i.gs+Tfe : 
< 8 < 2n}. Given T > 0, we want to show that the set B = Bs,t{<1,1) = : 
i.gtfe £ Ci, S < t < S + T} has exponentially small measure in T for sufficiently 
large I, S, and T. 

Given 8q £ -B, let Zq = i.gsrg , and consider the circle {i.gTrcf,gsre : < cf> < 2ir\ 
of radius T around it. By Theorem 12.11 we know that for most (the complement 
is exponentially small in T) directions <p on this circle, i.gtT$gsre £ Ci for some 
t < T. 

Our idea is as follows: there is a small neighborhood U of 9 such that each 
geodesic trajectory {i-gt r e} t =o 1 £ U is closely shadowed by a piecewise geodesic 
of the form 7 = 7e o ,0, 6 = , 3 / s,t(0), where 

i.g t re < t < S 

i-gt-sr<pgsre S <t < S + T 

By closely shadowed, we mean that d{^(t),i.gtro) is small for all t. Thus, if 
V(l(t)) < I for some S < S + T, we have that V(i.g t rg) < I, for some I > I. 

Now, for all but a small set of (j>, j(t) £ C\ for some S < t < S + T. Thus, in a 
small neighborhood of 8 , we have a (large-proportioned) collection of angles which 
are not in B = Bs,r(q, 0- 

To make this rigorous, fix e,6 > 0, and let A := A(0q) = Bq^t{zq,1) = {<fi : 
V(i.gtTj,gsrg ) > 1,0 < t < T}. Let d be the maximum thickness of a hyperbolic 
triangle. By logsmoothness of V, there is a b > 1 such that d(z\ : Z2) < d implies 
< b. Let li — bio, where l is as in the conclusion of Thcorcm l2.ll For any I, 
let l = l/b. 

Setting 5" = 5' (1,8) and I' = l/a = l/ba, where a = a(S) is as in Theorem 12.11 
yields 

v(A) < YM e -a-s")t, 

Let T, S be large enough so that we can apply Lemma 14.31 with e, and ti = T , 
t\ = S. This gives a neighborhood U of 80 in which the proportion of angles 

v(V(A)nU) V(i.g S re) - { i- S ")t 
— <4(l + e) sup e l >. 

v \ u ) ee[o,27r) ' 

Now, by the thinness of triangles in hyperbolic geometry, given 8 £ U, there is a <p 
with ^((f>) = 8 such that 

d(i.g t re,~ie a ,4,(t)) < d 

for t £ [0,S + T]. 

For all <fi £ A, we have 

7<w(*o) £ Cf, 
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for some S + T > t > S. Thus, 

g to r g q G C h 

so 9 <£ B. 

Thus, given any 9 £ B, we have produced a neighborhood U s.t. 

"'^^ 1-4(1 + .) sup i^I£) e -(.-»... 

To complete the proof, we need the following standard lemma (see, for exam- 
pie, 0): 

Lemma 6.2. Let B C [0, 2-7r) 6e a measurable set such that for all b G B, there is 
a 5b > 0, so that Ub = [b — 5b, b + 5b] C [0, 2tt) satisfies 

y(Ub n g) 
K^) < 

v(B) < 2e. 

Applying the lemma to our set B, we obtain 

V{i-9sre) (i-s») t 



v{B) < 8(1 + e) sup 
ee[o,27r) 



□ 



Proof of Theorem 12.31 Our strategy is as follows: Given a direction 9, consider 
the succesive departures and returns of the geodesic trajectory {i.gtre}t>o to the 
compact set C;. Theorem 12 . II implies that the probability any departure is long is 
small, and thus, we can try and apply Proposition 15.11 to the 'random variables' 
given by the length of sojourns inside and outside the compact set. 

We proceed as follows: Let d be as in the proof of Theorem 12. 21 Fix 5 > 0, and 
let Iq be as in Theorem 12.11 Let I > Iq be such that d{Cf,Ci ) > 2d, and define 
C = C{1) = d{Cf, Ci ) - 2d. Define t (9) = 0, and set 

*2n(0) = infO > t 2n -i ■ 30 such that d(i.g t r^,i.g t re) < d,i.g t r^ e Q } 

and 

hn+i(8) = inf{i > hn ■ 3</> such that d(i.gtr^,i.g t re) < d,i.g t r<f, Q}, 
for n > 0. Define Tj(#) = t% — h-i. Now fix C' > 0, and define auxiliary functions 

r 2l < a 

1~2i T 2l > C" 

and 

T 2 i-1 + T 2i T 2i < C 



T 2i -l T 2i > C 

Define t[ := J2]=i T 'y Notc that t 2i + r 2 i_i = r 2i + so t' 2l = t 2l . 

X(t) = < ^ *2i— 1 — * < 

[0 otherwise 

The ij's should be thought of as the entry and departure times of the trajectory 
{i.gtrg}t>o from our compact sets. For technical reasons, they arc defined as the 
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first time when a nearby trajectory leaves a larger compact set (Ci), or re-enters a 
smaller one (Ci ) . The T2i 's measure time spent after departing the larger set before 
returning to the smaller, and the T2i_i's measure the time spent after returning to 
the smaller before departing the larger. The auxiliary r/'s are defined to exclude 
short (< C") sojourns. Thus, if t' 2i < t < t' 2i+1 we arc within distance C of 
the larger compact set, and thus still within a compact set. Thus, for all such t, 
V(i.g t re)<l = l(C'), i.e., 

X{t) > xcs{i-9tre)- 

Thus, it suffices to show that for all A > there is a C > so that v (± J T X(t)dt > A 
decays exponentially in T. 

Wc will apply Proposition l5.1l to {r^}i>o, with il = S 1 , T the standard cr-algcbra, 
and P = v the Haar measure. Let T n = &(t[, . . . ,t' n ) be the cr-algcbra generated 
by t'i, . . . ,t' n . Clearly r' n is JF„-mcasurable. Note that since d(C ; c , C; ) = 2d + C, 

T 2i+1 ^ T 2i+1 = Hi+l — t%i > C 

(since i-gt 2 i r is within distance d of C; and i-gt 2i+ i r e is within distance d of Cf). 
Thus, condition 2 of the proposition is satisfied, with £ = C 
It remains to check condition 1. We will show 

(6.3) v{r' 2l > t\T 2i -i) < aie" a2t 

for some a\, a 2 > and all t > C . 

By definition, t'~ l {x) is an interval, and thus T% is generated by these intervals. 
Fix 6. Let 

I n (9) = {0 : t,-(0) = t'i(e) for all < i < n}. 
It suffices to show that there are ai,a 2 > such that, for all t > C, 

(6.4) v{r' 2n >t\h n -i{0)) <aie- Q2t . 

Fix 4> € l2n~i{6) with T2i(4>) > t. Let z$ = i.gt 2i _ 1 r ( f ) . Note that this point is within 
distance d of the boundary of Cf , thus it is both outside C/ and still contained 
within a compact set. Consider the circle of radius t around and the associated 
map = ^t,t 2 i-i back to the circle at i. 

Let A = A(<f>) = {9 : i.g s i'egt 2 i^ 1 i'4> ^ Ci o ,0 < s < t}. Since is contained in a 
compact set, we can pick c\ = c\(l), C2 = 02(^0) independent of <j) such that 

v{A) < cie- C2t , 

for all t sufficiently large. By picking C large enough, we can get this to hold for 
all t > C. For the rest of this section, let t > C . 

Applying lemma l4~3l with e = 1, we obtain a neighborhood U of <fi, with 

u(U) 

Note that U C X2n-i(&), since VO < t < t 2i -i,0' E U, 

d(i.g t r e/ ,i.g t r e ) < d. 

Finally, observe that T 2i (6>') < t for all 6' E U - * (A(cj))) , since 9' f *(A) implies 
that there is some < s < t such that V '(i-gsT^gt^^^ >) < h, with ^>(ip) = 9'. 
Thus, since d(i.c/ s r^,c/t 2i _ 1 r0, i.g s+t2i _^r0i) < d, we have V^i.^+j.^re') < I, and 
thus T 2 i{0') < t. Once again applying Lemma lb .21 we obtain equation l|6.4|l . with 
a\ = 32ci and a 2 = c 2 . 
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Let C be large enough so that a\e a2 ° < 1. Let 77 be a non-negative function 
on S 1 such that 

v{6 : r,(9) = 0} = ai e- a2C " 

and 

v{6 : r)(6) >t} = a ie - a2t 

for all t > C . Condition 1 is then clearly satisfied, since ^( r 2i = 0) > a\e~ a2C , and 
v(r' 2i > *) < aie~ a ' 2t for t > C. Now, note that E(r)) = ^e~ a2C ' , and E(£) = C. 
Thus, by enlarging C , we can make E(rf) /E(£) arbitrarily small. Precisely for any 
A > 0, take C so that 

E(v)/E(0 < A. 

Then, setting I = 1(C) and applying Proposition 15.11 we obtain that there is a 
7 < 1 so that 



v{6:^\{Q<t<T:g t r e qtC l }\>\}<v\ 




T 



T 



for all T sufficiently large. □ 

Proof of Corollary 12. 41 Applying Corollarv l5.4l to X(t), we obtain our result. □ 
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